Reinforcement learning

Results: 1147



#Item
901Game theory / Cybernetics / Machine learning / Search algorithms / Learning / Reinforcement learning / Markov decision process / Multi-armed bandit / Algorithm / Statistics / Mathematics / Applied mathematics

Hedged learning: Regret-minimization with learning experts Yu-Han Chang [removed] CSAIL, Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA[removed]USA Leslie Pack Kaelbling

Add to Reading List

Source URL: www.machinelearning.org

Language: English - Date: 2008-12-01 11:15:56
902Ballistics / Aerodynamics / Mechanics / Trajectory / Aerobatics / Reinforcement learning / Stall / Flight / Aerospace engineering / Motion

Parameterized Maneuver Learning for Autonomous Helicopter Flight Jie Tang, Arjun Singh, Nimbus Goehausen, and Pieter Abbeel Abstract— Many robotic control tasks involve complex dynamics that are hard to model. Hand-spe

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2010-06-09 03:17:59
903Control theory / Linear filters / Stochastic differential equations / Kalman filter / Markov decision process / Normal distribution / Gaussian process / Q-learning / SARSA / Statistics / Markov models / Stochastic processes

Reinforcement learning with Gaussian processes Yaakov Engel Dept. of Computing Science, University of Alberta, Edmonton, Canada Shie Mannor Dept. of Electrical and Computer Engineering, McGill University, Montreal, Cana

Add to Reading List

Source URL: www.machinelearning.org

Language: English - Date: 2008-12-01 11:15:01
904Multi-agent systems / Reinforcement learning / Q-learning / Agent-based model / Action selection / Affect / Machine learning / Intelligent agent / Artificial intelligence / Science / Mind

Dynamic Analysis of Multiagent Q-learning with Exploration Ç«-greedy Eduardo Rodrigues Gomes

Add to Reading List

Source URL: www.machinelearning.org

Language: English - Date: 2009-05-18 12:17:09
905Learning / Netflix / Algorithm / Supervised learning / Recommender system / Reinforcement learning / Cluster analysis / Overfitting / Concept learning / Statistics / Machine learning / Artificial intelligence

Yaser S. Abu-Mostafa is a professor of electrical engineering and computer science at the California Institute of Technology. ARTIFICIAL INTELLIGENCE

Add to Reading List

Source URL: work.caltech.edu

Language: English - Date: 2012-07-11 00:45:11
906Dynamic programming / Markov processes / Stochastic control / Operations research / Mathematical optimization / Markov decision process / Q-learning / Reinforcement learning / Convex hull / Mathematics / Algebra / Statistics

Learning All Optimal Policies with Multiple Criteria Leon Barrett Srini Narayanan 1947 Center St. Ste. 600, Berkeley, CA 94704

Add to Reading List

Source URL: www.machinelearning.org

Language: English - Date: 2008-05-22 03:19:26
907Supervised learning / Cluster analysis / Bayesian network / Regression analysis / Reinforcement learning / Hidden Markov model / Michael I. Jordan / Statistical classification / Semi-supervised learning / Statistics / Machine learning / Artificial intelligence

Table of Contents Preface .................................................................................................................................................................... xiii Organization ...........

Add to Reading List

Source URL: www.icml2010.org

Language: English - Date: 2010-07-25 05:16:19
908Estimation theory / Statistical theory / Dynamic programming / Markov decision process / Reinforcement learning / Markov chain / Maximum likelihood / XTR / Statistics / Markov processes / Markov models

Exploration and Apprenticeship Learning in Reinforcement Learning Pieter Abbeel Andrew Y. Ng Computer Science Department, Stanford University Stanford, CA 94305, USA

Add to Reading List

Source URL: www.machinelearning.org

Language: English - Date: 2008-12-01 11:16:12
909Ensemble learning / Operations research / Reinforcement learning / Gradient boosting / Boosting / Regression analysis / Mathematical optimization / Function / Supervised learning / Machine learning / Mathematics / Statistics

Non-Parametric Policy Gradients: A Unified Treatment of Propositional and Relational Domains Kristian Kersting [removed] Dept. of Knowledge Discovery, Fraunhofer IAIS, Schloss Birlinghoven, 537

Add to Reading List

Source URL: www.machinelearning.org

Language: English - Date: 2008-05-02 08:01:38
910Preconditioner / Statistics / Reinforcement learning / Iterative method / Markov chain / Matrix / Sparse matrix / Applied mathematics / Markov models / Numerical linear algebra / Mathematics

Preconditioned Temporal Difference Learning Hengshuai Yao Zhi-Qiang Liu School of Creative Media, City University of Hong Kong, Hong Kong, China

Add to Reading List

Source URL: www.machinelearning.org

Language: English - Date: 2008-05-23 03:34:40
UPDATE